TESI DOCTORAL Decision Threshold Estimation and Model Quality Evaluation Techniques for Speaker Verification
نویسنده
چکیده
The number of biometric applications has increased a lot in the last few years. In this context, the automatic person recognition by some physical traits like fingerprints, face, voice or iris, plays an important role. Users demand this type of applications every time more and the technology seems already mature. People look for security, low cost and accuracy but, at the same time, there are many other factors in connection with biometric applications that are growing in importance. Intrusiveness is undoubtedly a burning factor to decide about the biometrics we will used for our application. At this point, one can realize about the suitability of speaker recognition because voice is the natural way of communicating, can be remotely used and provides a low cost. Automatic speaker recognition is commonly used in telephonic applications although it can also be used in physical access control or in forensics. Speaker verification and speaker identification have several stages. First of all, one can find the parameterization stage of the voice signal, where the signal is processed to be modeled or compared. After that, we find the model estimation if we are training or the decision stage if we are making a comparison. This PhD is focused on the training and the decision stages of a speaker verification system. In these kind of systems, the result of the comparison between a utterance and a model depends on the decision threshold. The speaker is accepted if the obtained score is above the threshold and rejected if below. On the other hand, the quality of the utterances used to train the model will have a high influence on the performance. The way of detecting low quality utterances is also studied in this PhD. In real applications, it is common to have only a few data to estimate the model and the decision threshold. Furthermore, the non-availability of impostor material is also a negative aspect. The lack of data makes that low quality utterances or background noises have a great impact on performance. In this PhD, a new speaker-dependent threshold estimation method based only on client data and a method to detect outliers are introduced. Furthermore, new quality evaluation methods are also proposed. One interesting way of determining the quality of the utterances consists of detecting quality on-line, during training. By using this method, new quality utterances from the same speaker can be automatically replaced, in the same …
منابع مشابه
Robust methods of updating model and a priori threshold in speaker verification
We describe a method of updating a hidden Markov model (HMM) for speaker verification using a small amount of new data for each speaker. The HMM is updated by adapting the model parameters to the new data by maximum a posteriori (MAP) estimation. The initial values of the a priori parameters in MAP estimation are set using training speech used for first creating a speaker HMM. We also present a...
متن کاملTowards better making a decision in speaker verification
Speaker veri!cation is a process that accepts or rejects the identity claim of a speaker. How to make a decision is a critical problem; a threshold for decision-making critically determines performance of a speaker veri!cation system. Traditional threshold estimation methods take only information conveyed by training data into consideration and, to a great extent, do not relate it to production...
متن کاملExploiting GMM-based Quality Measure for SVM Speaker Verification
In this paper, we examine the problem of quality measurement for speaker verification using support vector machines (SVMs). An efficient Gaussian mixture models (GMMs) based quality estimation algorithm is proposed to potentially utilize speaker-specific broad acoustic-class characteristics. Some verification strategies are also considered in the test phase. We perform clustering-based vector p...
متن کاملEvaluation of speech quality measures for the purpose of speaker verification
Real-world deployment of speaker verification systems often have to contend with degraded signal quality and erratic statistical behaviour of the speech data being modelled. We present signal quality estimation techniques for extraction of additional information about the speech data that can be used to improve performance of speaker verification systems in degraded conditions. We propose metho...
متن کاملBayesian bpproach based decision in speaker verification
Considering Bayesian decision framework applied in the context of speaker verification, this paper presents a new way of handling troublesome anti-speaker model by proposing a redefinition of hypotheses involved in the classical statistical hypothesis test. This new definition of hypotheses is then implemented through a speaker independent normalization technique, named MAP approach. Besides su...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006